智能论文笔记

Hierarchical Federated Learning with Privacy

Varun Chandrasekaran , Suman Banerjee , Diego Perino , Nicolas Kourtellis

分类：机器学习

2022-06-10

联合学习（FL），数据保留在联合客户端，并且仅与中央聚合器共享梯度更新是私人的。最近的工作表明，具有梯度级别访问权限的对手可以成功进行推理和重建攻击。在这种情况下，众所周知，差异化（DP）学习可以提供弹性。但是，现状中使用的方法（\ ie中央和本地DP）引入了不同的公用事业与隐私权衡权衡。在这项工作中，我们迈出了通过{\ em层次fl（HFL）}来缓解此类权衡的第一步。我们证明，通过引入一个新的中介层，可以添加校准的DP噪声，可以获得更好的隐私与公用事业权衡；我们称此{\ em层次结构DP（HDP）}。我们使用3个不同数据集的实验（通常用作FL的基准）表明HDP产生的模型与使用中央DP获得的模型一样准确，在中央聚集器处添加了噪声。这种方法还为推理对手提供了可比的好处，例如在本地DP案例中，在联合客户端添加了噪音。

translated by 谷歌翻译

Few-Shot Domain Adaptation For End-to-End Communication

Jayaram Raghuram , Yijing Zeng , Dolores García Martí , Rafael Ruiz Ortiz , Somesh Jha , Joerg Widmer , Suman Banerjee

分类：机器学习 | (统计)机器学习

2021-08-02

最近，使用自动编码器（由使用神经网络建模的编码器，渠道和解码器组成）的通信系统的端到端学习问题最近被证明是一种有希望的方法。实际采用这种学习方法面临的挑战是，在变化的渠道条件（例如无线链接）下，它需要经常对自动编码器进行重新训练，以保持低解码错误率。由于重新培训既耗时又需要大量样本，因此当通道分布迅速变化时，它变得不切实际。我们建议使用不更改编码器和解码器网络的快速和样本（几射击）域的适应方法来解决此问题。不同于常规的训练时间无监督或半监督域的适应性，在这里，我们有一个训练有素的自动编码器，来自源分布，我们希望（在测试时间）使用仅使用一个小标记的数据集和无标记的数据来适应（测试时间）到目标分布。我们的方法着重于基于高斯混合物网络的通道模型，并根据类和组件条件仿射变换制定其适应性。学习的仿射转换用于设计解码器的最佳输入转换以补偿分布变化，并有效地呈现在接近源分布的解码器输入中。在实际MMWAVE FPGA设置以及无线设置共有的许多模拟分布变化上，使用非常少量的目标域样本来证明我们方法在适应时的有效性。

translated by 谷歌翻译

A System-Level View on Out-of-Distribution Data in Robotics

Rohan Sinha , Apoorva Sharma , Somrita Banerjee , Thomas Lew , Rachel Luo , Spencer M. Richards , Yixiao Sun , Edward Schmerling , Marco Pavone

分类：机器人 | 机器学习

2022-12-28

When testing conditions differ from those represented in training data, so-called out-of-distribution (OOD) inputs can mar the reliability of black-box learned components in the modern robot autonomy stack. Therefore, coping with OOD data is an important challenge on the path towards trustworthy learning-enabled open-world autonomy. In this paper, we aim to demystify the topic of OOD data and its associated challenges in the context of data-driven robotic systems, drawing connections to emerging paradigms in the ML community that study the effect of OOD data on learned models in isolation. We argue that as roboticists, we should reason about the overall system-level competence of a robot as it performs tasks in OOD conditions. We highlight key research questions around this system-level view of OOD problems to guide future research toward safe and reliable learning-enabled autonomy.

translated by 谷歌翻译

GraphCast: Learning skillful medium-range global weather forecasting

Remi Lam , Alvaro Sanchez-Gonzalez , Matthew Willson , Peter Wirnsberger , Meire Fortunato , Alexander Pritzel , Suman Ravuri , Timo Ewalds , Ferran Alet , Zach Eaton-Rosen

分类：机器学习

2022-12-24

We introduce a machine-learning (ML)-based weather simulator--called "GraphCast"--which outperforms the most accurate deterministic operational medium-range weather forecasting system in the world, as well as all previous ML baselines. GraphCast is an autoregressive model, based on graph neural networks and a novel high-resolution multi-scale mesh representation, which we trained on historical weather data from the European Centre for Medium-Range Weather Forecasts (ECMWF)'s ERA5 reanalysis archive. It can make 10-day forecasts, at 6-hour time intervals, of five surface variables and six atmospheric variables, each at 37 vertical pressure levels, on a 0.25-degree latitude-longitude grid, which corresponds to roughly 25 x 25 kilometer resolution at the equator. Our results show GraphCast is more accurate than ECMWF's deterministic operational forecasting system, HRES, on 90.0% of the 2760 variable and lead time combinations we evaluated. GraphCast also outperforms the most accurate previous ML-based weather forecasting model on 99.2% of the 252 targets it reported. GraphCast can generate a 10-day forecast (35 gigabytes of data) in under 60 seconds on Cloud TPU v4 hardware. Unlike traditional forecasting methods, ML-based forecasting scales well with data: by training on bigger, higher quality, and more recent data, the skill of the forecasts can improve. Together these results represent a key step forward in complementing and improving weather modeling with ML, open new opportunities for fast, accurate forecasting, and help realize the promise of ML-based simulation in the physical sciences.

translated by 谷歌翻译

Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media

Yuting Guo , Swati Rajwal , Sahithi Lakamana , Chia-Chun Chiang , Paul C. Menell , Adnan H. Shahid , Yi-Chieh Chen , Nikita Chhabra , Wan-Ju Chao , Chieh-Ju Chao

分类：自然语言处理

2022-12-23

Migraine is a high-prevalence and disabling neurological disorder. However, information migraine management in real-world settings could be limited to traditional health information sources. In this paper, we (i) verify that there is substantial migraine-related chatter available on social media (Twitter and Reddit), self-reported by migraine sufferers; (ii) develop a platform-independent text classification system for automatically detecting self-reported migraine-related posts, and (iii) conduct analyses of the self-reported posts to assess the utility of social media for studying this problem. We manually annotated 5750 Twitter posts and 302 Reddit posts. Our system achieved an F1 score of 0.90 on Twitter and 0.93 on Reddit. Analysis of information posted by our 'migraine cohort' revealed the presence of a plethora of relevant information about migraine therapies and patient sentiments associated with them. Our study forms the foundation for conducting an in-depth analysis of migraine-related information using social media data.

translated by 谷歌翻译

Scalable Hybrid Learning Techniques for Scientific Data Compression

Tania Banerjee , Jong Choi , Jaemoon Lee , Qian Gong , Jieyang Chen , Scott Klasky , Anand Rangarajan , Sanjay Ranka

分类：机器学习

2022-12-21

Data compression is becoming critical for storing scientific data because many scientific applications need to store large amounts of data and post process this data for scientific discovery. Unlike image and video compression algorithms that limit errors to primary data, scientists require compression techniques that accurately preserve derived quantities of interest (QoIs). This paper presents a physics-informed compression technique implemented as an end-to-end, scalable, GPU-based pipeline for data compression that addresses this requirement. Our hybrid compression technique combines machine learning techniques and standard compression methods. Specifically, we combine an autoencoder, an error-bounded lossy compressor to provide guarantees on raw data error, and a constraint satisfaction post-processing step to preserve the QoIs within a minimal error (generally less than floating point error). The effectiveness of the data compression pipeline is demonstrated by compressing nuclear fusion simulation data generated by a large-scale fusion code, XGC, which produces hundreds of terabytes of data in a single day. Our approach works within the ADIOS framework and results in compression by a factor of more than 150 while requiring only a few percent of the computational resources necessary for generating the data, making the overall approach highly effective for practical scenarios.

translated by 谷歌翻译

Learning Action-Effect Dynamics for Hypothetical Vision-Language Reasoning Task

Shailaja Keyur Sampat , Pratyay Banerjee , Yezhou Yang , Chitta Baral

分类：计算机视觉

2022-12-07

'Actions' play a vital role in how humans interact with the world. Thus, autonomous agents that would assist us in everyday tasks also require the capability to perform 'Reasoning about Actions & Change' (RAC). This has been an important research direction in Artificial Intelligence (AI) in general, but the study of RAC with visual and linguistic inputs is relatively recent. The CLEVR_HYP (Sampat et. al., 2021) is one such testbed for hypothetical vision-language reasoning with actions as the key focus. In this work, we propose a novel learning strategy that can improve reasoning about the effects of actions. We implement an encoder-decoder architecture to learn the representation of actions as vectors. We combine the aforementioned encoder-decoder architecture with existing modality parsers and a scene graph question answering model to evaluate our proposed system on the CLEVR_HYP dataset. We conduct thorough experiments to demonstrate the effectiveness of our proposed approach and discuss its advantages over previous baselines in terms of performance, data efficiency, and generalization capability.

translated by 谷歌翻译

Learning Action-Effect Dynamics from Pairs of Scene-graphs

Shailaja Keyur Sampat , Pratyay Banerjee , Yezhou Yang , Chitta Baral

分类：计算机视觉

2022-12-07

'Actions' play a vital role in how humans interact with the world. Thus, autonomous agents that would assist us in everyday tasks also require the capability to perform 'Reasoning about Actions & Change' (RAC). Recently, there has been growing interest in the study of RAC with visual and linguistic inputs. Graphs are often used to represent semantic structure of the visual content (i.e. objects, their attributes and relationships among objects), commonly referred to as scene-graphs. In this work, we propose a novel method that leverages scene-graph representation of images to reason about the effects of actions described in natural language. We experiment with existing CLEVR_HYP (Sampat et. al, 2021) dataset and show that our proposed approach is effective in terms of performance, data efficiency, and generalization capability compared to existing models.

translated by 谷歌翻译

Generalised Spherical Text Embedding

Souvik Banerjee , Bamdev Mishra , Pratik Jawanpuria , Manish Shrivastava

分类：自然语言处理

2022-11-30

This paper aims to provide an unsupervised modelling approach that allows for a more flexible representation of text embeddings. It jointly encodes the words and the paragraphs as individual matrices of arbitrary column dimension with unit Frobenius norm. The representation is also linguistically motivated with the introduction of a novel similarity metric. The proposed modelling and the novel similarity metric exploits the matrix structure of embeddings. We then go on to show that the same matrices can be reshaped into vectors of unit norm and transform our problem into an optimization problem over the spherical manifold. We exploit manifold optimization to efficiently train the matrix embeddings. We also quantitatively verify the quality of our text embeddings by showing that they demonstrate improved results in document classification, document clustering, and semantic textual similarity benchmark tests.

translated by 谷歌翻译

Machine learning emulation of a local-scale UK climate model

Henry Addison , Elizabeth Kendon , Suman Ravuri , Laurence Aitchison , Peter AG Watson

分类：机器学习

2022-11-29

Climate change is causing the intensification of rainfall extremes. Precipitation projections with high spatial resolution are important for society to prepare for these changes, e.g. to model flooding impacts. Physics-based simulations for creating such projections are very computationally expensive. This work demonstrates the effectiveness of diffusion models, a form of deep generative models, for generating much more cheaply realistic high resolution rainfall samples for the UK conditioned on data from a low resolution simulation. We show for the first time a machine learning model that is able to produce realistic samples of high-resolution rainfall based on a physical model that resolves atmospheric convection, a key process behind extreme rainfall. By adding self-learnt, location-specific information to low resolution relative vorticity, quantiles and time-mean of the samples match well their counterparts from the high-resolution simulation.

translated by 谷歌翻译